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Amendments to the Claims : 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims : 

1 . (Currently Amended) An apparatus for facilitating data clustering of speech 
and audio data , said apparatus comprising: 

an arrangement for obtaining input data; and 

an arrangement for creating a predetermined number of non-overlapping subsets 
of the input data; 

said arrangement for creating a predetermined number of non-overlapping subsets 
being adapted to split the input data recursively; 

said clustering being independent of any system . 

2. (Original) The apparatus according to Claim 1, wherein said arrangement for 
creating a predetermined number of non-overlapping subsets is adapted to initially split 
the input data into at least two sets of output data. 

3. (Original) The apparatus according to Claim 2, wherein said arrangement for 
creating a predetermined number of non-overlapping subsets is adapted to: 

split the at least two sets of output data recursively; and 
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repeat the recursive splitting of output data sets until the predetermined number of 
non-overlapping subsets is obtained. 

4. (Original) The apparatus according to Claim 2, wherein said arrangement for 
creating a predetermined number of nonroverlapping subsets is adapted to determine an 
eigenvector decomposition relating to the input data. 

5. (Original) The apparatus according to Claim 4, wherein said arrangement for 
creating a predetermined number of non-overlapping subsets is adapted to determine a 
vector of projection coefficients onto the set of eigenvectors in the eigenvector 
decomposition. 

6. (Currently Amended) The apparatus according to Claim 5, wherein said 
arrangement for creating a predetermined number of non-overlapping subsets is adapted 
to determine a probability density distribution relating to the vector of projection 
coefficients. 

7. (Original) The apparatus according to Claim 6, wherein said arrangement for 
creating a predetermined number of non-overlapping subsets is adapted to: 

assign at least one threshold relating to the probability density; and 

yield the at least two sets of output data based on the relation to the threshold of a 
value associated with a function relating to the projection coefficients. 

8. (Original) The apparatus according to Claim 7, wherein there are N-l 
thresholds, where N is the number of sets of output data to be yielded. 
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9. (Currently Amended) The apparatus according to Claim 8, wherein each 
threshold is a value of the function relating to the projection coefficients for which the 
probability d e nsity distribution equals m/N, where m is a number from 1 to N-l . 

10. (Original) The apparatus according to Claim 1, wherein the data clustering 
relates to the enrollment of target speakers in a speaker verification system. 

1 1 . (Currently Amended) A method of f acilitaUng data clustering of speech and 
audio data , said method comprising the steps of: 

obtaining input data; and 

creating a predetermined number of non-overlapping subsets of the input data; 

step of creating a predetermined number of non-overlapping subsets comprising splitting 
the input data recursively; 

said clustering being independent of any system . 

12. (Original) The method according to Claim 1 1, wherein said splitting step 
comprises initially splitting the input data into at least two sets of output data. 

13. (Original) The method according to Claim 12, wherein said splitting step 
comprises: 

splitting the at least two sets of output data recursively; and 
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repeating the recursive splitting of output data sets until the predetermined 
number of non-overlapping subsets is obtained, 

14. (Original) The method according to Claim 12, wherein said splitting step 
comprises determining an eigenvector decomposition relating to the input data. 

15. (Original) The method according to Claim 14, wherein said splitting step 
further comprises determining a vector of projection coefficients onto the set of 
eigenvectors in the eigenvector decomposition. 

16. (Currently Amended) The method according to Claim 15, wherein said 
splitting step further comprises determining a probability density distribution relating to 
the vector of projection coefficients. 

17. (Currently Amended) The method according to Claiml6, wherein said 
splitting step further comprises: 

assigning at least one threshold relating to the probability density distribution ; and 

yielding the at least two sets of output data based on the relation to the threshold 
of a value associated with a function relating to the projection coefficients. 

18. (Original) The method according to Claim 17 7 wherein there are N-l 
thresholds, where N is the number of sets of output data to be yielded. 
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19. (Currently Amended) The method according to Claim 18, wherein each 
threshold is a value of the function relating to the projection coefficients for which the 
probability d e n s ity distribution equals m/N, where m is a number from 1 to N-L 

20. (Original) The method according to Claim 1, wherein the data clustering 
relates to the enrollment of target speakers in a speaker verification system. 

21. (Currently Amended) A program storage device readable by machine, 

. tangibly embodying a program of instructions executable by the machine to perform 
method steps for facilitating date clustering of speech and audio data , said method 
comprising the steps of: 

obtaining input data; and 

creating a predetermined number of non-overlapping subsets of the input data; 

step of creating a predetermined number of non-overlapping subsets comprising 
splitting the input data recursively; 

said clustering being independent of any system . 
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